Listing of the Claims 

1 . (Currently Amended) A method of generating an index for a sequence that 
supports a non-contiguous subsequence match, comprising: 

receiving a sequence; 

receiving a window size; 

encoding the sequence into a weighted-sequence; 

encoding the weighted sequence into one or more one-dimensional sequences, 
wherein the length of each of the one or more one-dimensional sequences is less than 
the window size; 

inserting each of the one or more one-dimensional sequences as one or more 
trie nodes into a trie structure; and 

generating thean index, wherein generating the index comprises comprising : 

generating a current sequential ID and a maximum sequential ID pair 
for gen e rating each of the one or more trie nodes, wherein the current 
sequential ID of any descendant of a given trie node of the one or more trie 
nodes is between the current sequential ID of the given trie node and the 
maximum sequential ID; 

generating an iso-depth link for each unique symbol in each of the one 
or more one-dimensional sequences, wherein the iso-depth link comprises trie 
nodes under the symbol; and 

generating an offset list comprising an original position offer each of4he 
one or more subsequences in the weighted-sequence. 
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2. (Currently Amended) The method of claim 1, wherein encoding the sequence 
into athe weighted-sequence comprises encoding the sequence with weights 
represented by real numbers[;]. 

3. (Currently Amended) The method of claim 2, wherein encoding the sequence 
with weights represented by real numbers[,] comprises discretizing the sequence into a 
number of equi- width units. 

4. (Currently Amended) The method of claim 1 , wherein inserting each of the 
one or more one-dimensional sequences into athe trie structure compris e s is performed 
using a depth-first traversal. 

5. (Currently Amended) The method of claim 1, further comprising creating 
athe weighted-sequences index, wh e r e in th e w e ight e d s e qu e nc e s ind e x compris e s an 
iso d e pth ind e x, compris e s cr e ating th e w e ight e d s e quenc e s ind e x, wherein the 
weighted-sequences index comprises an iso-depth index, wherein the iso-depth index is 
a one-dimensional buffer. 

6. (Currently Amended) The method of claim 1 , further comprisin g wh e r e in 
creating athe weighted-sequences index, wh e r e in th e w e ight e d sequ e nces ind e x 
compris es an iso d e pth ind e x, compris e s cr e ating th e w e ight e d sequ e nc e s ind e x, 
wherein the weighted-sequences index comprises an iso-depth index, wherein the iso- 
depth index is a B + tree. 
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7. (Currently Amended) The method of claim 1, further comprisin g wh e r e in 
creating thea weighted-sequences index, wh e r e in the w e ight e d s e qu e nc e s ind e x 
comprises an iso d e pth ind e x, compris es cr e ating th e w e ighted s e qu e nc e s ind e x, 
wherein the weighted-sequences index comprises an iso-depth index, wherein the iso- 
depth index is a linked list. 

8. (Currently Amended) The method of claim 1, wherein receiving athe 
sequence comprises receiving one or more elements in the sequence, wherein each of 
the one or more elements are represented by one or more (symbol w e ight) pairs of 
symbol and weight elements . 

9. (Currently Amended) The method of claim 8, wh e r e in r e c e iving on e or more 
e l e m e nts in th e s e qu e nc e , wh e r e in e ach of the on e or mor e e l e m e nts ar e r e pr e s e nt e d by 
on e or mor e (symbol, w e ight) pairs, and wherein e ach of the symbol elements of th e 
on e or mor e (symbol, w e ight) pairs correspond to a non-uniform frequency 
distribution. 

10. (Currently Amended) The method of claim 9, further comprising reordering 
the one or more one one-dimensional sequences using the non-uniform frequency 
distribution to generate a new sequence p rior to inserting each of the one or more one- 
dimensional sequences into athe trie structur e using th e non uniform frequency 
di s tribution to gen e rat e a n e w sequ e nc e. 
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1 1 (Currently Amended) The method of claim 10, wherein reordering the one 
or more one one-dimensional sequences using the non-uniform frequency distribution 
to generate a new sequence p rior to inserting each of the one or more one-dimensional 
sequences into athe trie structure using th e non uniform fr e qu e ncy distribution to 
g e nerat e on e or mor e n e w se qu e nc e s , comprises: 

(a) adding an offset 2*w*r to each of the weight elements of th e on e or mor e 
on e dim e nsional s e quenc e s to generate a new weight , wherein w is a window size, mid 
r is a frequency rank for a symbol of each of the symbol elements to g e nerat e a n e w 
w e ight ; 

(b) sorting the pairs of symbol and weight elements th e e ach e l e m e nt of th e on e 
or mor e on e dim e nsional s e qu e nc e s by the new weight; 

(c) placing a moving window of size 2*w*A on the on e or mor e new 
sequences, wherein A is a frequenc yt h e total numb e r of the symbols; and 

(d) indexing the on e or mor e new sequences in a new window. 

12. (Currently Amended) The method of claim 1, wherein receiving athe 
sequence comprises receiving one or more scientific datasets, transforming each of the 
one or more scientific datasets into one or more sequences, and concatenating the one 
or more sequences to form a long sequence. 

13. (Cancelled) 

14. (Cancelled) 
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1 5. (Currently Amended) A program storage device readable by machine, 
tangibly embodying a program of instructions executable by the machine to perform 
instructions executable by the machine m achin e r e adable m e dium having instructions 
stor e d th e r e on for e x e cution by a processor to perform a method steps of generating an 
index for a sequence that supports a non-contiguous subsequence match, the method 
steps comprisin g th e st e ps of : 

receiving a sequence; 

receiving a window size; 

encoding the sequence into a weighted-sequence; 

encoding the weighted sequence into one or more one-dimensional sequences, 
wherein the length of each of the one or more one-dimensional sequences is less than 
the window size; 

inserting each of the one or more one-dimensional sequences as one or more 
trie nodes into a trie structure; and 

generated an index, wherein generating the index comprises cr e ating th e ind e x 
comprising : 

generating a current sequential ID and a maximum sequential ID pair 
for each of the one or more trie nodes, wherein the current sequential ID of any 
descendant of a given trie node of the one or more trie nodes is between the 
current sequential ID of the given trie node and the maximum sequential ID; 

generating an iso-depth link for each unique symbol in each of the one 
or more one-dimensional sequences, wherein the iso-depth link comprises trie 
nodes under the symbol; and 
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' generating an offset list comprising an original position foref each of-the 
one or more subsequences in the weighted-sequence. 

16. (Cancelled) 
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